Back

Journal of Sleep Research

Wiley

Preprints posted in the last 90 days, ranked by how well they match Journal of Sleep Research's content profile, based on 31 papers previously published here. The average preprint has a 0.02% match score for this journal, so anything above that is already an above-average fit.

1
Multidimensional Sleep Health and Cognitive Risk in Midlife Primary Care: Comparing Questionnaires

Kim, M.; Bonham, M.; Yeh, F.; Rogers, L.; Ho, E. H.; Curtis, L.; Benavente, J. Y.; Bailey, S. C.; Linder, J. A.; Wolf, M. S.; Zee, P. C.

2026-04-17 primary care research 10.64898/2026.04.15.26350952 medRxiv
Top 0.1%
81.0%
Show abstract

ImportanceSleep-wake disturbances in midlife are common and potentially modifiable contributors to long-term brain health, yet primary care lacks a brief, validated tool that reliably identifies adults with early cognitive vulnerability. ObjectiveTo evaluate associations between commonly used sleep questionnaires and cognitive impairment among midlife primary care patients. Design, Setting, and ParticipantsCross-sectional analysis of baseline data from the MidCog cohort, an observational study of English-speaking adults aged 35 to 64 years receiving primary care at academic practices or federally qualified health centers in the Chicagoland area. ExposuresFive validated sleep questionnaires were used to assess distinct sleep-wake disturbance phenotypes: (A) unsatisfactory sleep (PROMIS Sleep Disturbance T-score >55), (B) short sleep duration (<6 hours; Munich Chronotype Questionnaire), (C) obstructive sleep apnea (OSA) risk (STOP-Bang [&ge;]3), (D) insomnia symptoms (Insomnia Severity Index [&ge;]15), and (E) poor multidimensional sleep health (RU-SATED [&le;]6). Main Outcomes and MeasuresThe primary outcome was cognitive impairment defined as an age- and education-adjusted NIH Toolbox Cognition Battery (NIHTB-CB) Fluid Composite T-score <40 (>1 SD below the population mean). Cognitive impairment defined by the Montreal Cognitive Assessment (MoCA) score <23 served as the secondary outcome. Logistic regression estimated adjusted odds ratios (aOR), controlling for age, sex, education, body mass index, hypertension, hypercholesterolemia, diabetes, smoking, depressive symptoms, and recruitment site. ResultsAmong 646 participants (mean [SD] age, 52.3 [8.1] years; 62.4% female; 38.0% non-Hispanic Black, 38.4% non-Hispanic White, 16.0% Hispanic), cognitive impairment was present in 18.7% by NIHTB-CB and 22.3% by MoCA. Among five sleep-wake disturbance phenotypes evaluated, only poor multidimensional sleep health was consistently associated with cognitive impairment after multivariable adjustment (NIHTB-CB: adjusted OR [95% CI] = 2.03 [1.25-3.26]; MoCA: 1.98 [1.20-3.26]). Conclusions and RelevancePoor multidimensional sleep health was associated with cognitive impairment in midlife primary care patients. Brief multidimensional sleep health screening may identify individuals with early cognitive vulnerability and represent a potential strategy for targeting sleep-focused interventions to promote long-term brain health. Key PointsO_ST_ABSQuestionC_ST_ABSAmong commonly used brief sleep questionnaires, which measure, if any, best identifies midlife primary care patients at risk of early cognitive vulnerability? FindingsIn this cross-sectional study of 646 primary care patients aged 35-64 years, poor multidimensional sleep health assessed using the RU-SATED questionnaire was the only sleep-wake disturbance phenotype consistently associated with cognitive impairment across two cognitive measures (NIH Toolbox Cognitive Battery and Montreal Cognitive Assessment). MeaningBrief multidimensional sleep health screening may help identify midlife adults with sleep-related early cognitive vulnerability in primary care and may represent a potential target for sleep-focused interventions to promote long-term brain health.

2
Disentangling Fatigue from Depression among Survivors of Severe COVID-19

Cabrera, J. R.; Pham, P.; Boscardin, W. J.; Makam, A. N.

2026-04-27 primary care research 10.64898/2026.04.24.26351694 medRxiv
Top 0.1%
29.0%
Show abstract

PurposeSurvivors of severe COVID-19 commonly experience post-intensive care syndrome (PICS), which includes depression and fatigue. Fatigue is far more common and may inflate depression severity given overlapping symptoms. We sought to disentangle fatigue from depression in PICS. MethodsWe conducted a cross-sectional analysis of the RAFT COVID study, a national multicenter longitudinal cohort of severe prolonged COVID-19 survivors. We included participants who completed validated surveys at 1-year from hospitalization for depression (PHQ-9) and fatigue (FACIT-Fatigue). We described correlation of FACIT-fatigue with the PHQ9, and separately with PHQ-2 and PHQ-7, which both omit the two items we hypothesized are influenced by fatigue--tiredness and sleeping. Using a MIMIC model, we performed differential item functioning to evaluate the impact of fatigue on depression directly through these two questions and indirectly with the latent depression construct. We then compared PHQ-7 to PHQ-9 scores by fatigue status. ResultsAmong 82 participants, 61.0% reported fatigue (reverse-scored FACIT-Fatigue [&ge;]9), and 15.9% moderately severe depression (PHQ-9 [&ge;]10). FACIT-fatigue was strongly correlated with PHQ-9 (r=.87, p<.001), but less so for PHQ-2 (r=.76, p<.001) and PHQ-7 (r=.82, p<.001). The MIMIC model identified significant direct effects on tiredness ({lambda}=.89, p<.001) and sleep ({lambda}=.52, p<.001). Among fatigued participants, the rescaled PHQ-7 was lower than the PHQ-9 (median of 4.5, IQR 1.50-9.75, vs 7, IQR 4-9.75). ConclusionsFatigue significantly inflated depression symptoms in severe COVID-19 survivors through tiredness and sleeping PHQ-9 items. PHQ-2 may better screen for true depressive symptoms in PICS, minimizing the risk of misdiagnosis and overtreatment. PLAIN ENGLISH SUMMARYSurvivors of severe COVID-19 illness commonly experience post-intensive care syndrome (PICS), which includes depression and fatigue. Fatigue is far more common and may inflate depression severity given overlapping symptoms. We sought to disentangle fatigue from depression in PICS. We found that the presence of fatigue inflated depression severity through symptoms of tiredness and difficulty sleeping, which are two of the nine items of a commonly used depression screening tool, known as the Patient Health Questionnaire-9 (PHQ-9). Depression screening tools that omit these two items, such as the PHQ-2, may better screen for depressive symptoms in PICS, minimizing the risk of overestimating depression symptoms and potentially misdiagnosis.

3
Multimodal sleep stage classification and label-free abnormality scoring in mid-to-older adults

Nur, Z.; Bijlani, N.; Villarroel, M.

2026-06-05 health informatics 10.64898/2026.05.28.26353980 medRxiv
Top 0.1%
23.0%
Show abstract

Background: Sleep fragmentation and reduced sleep efficiency are markers of disrupted sleep architecture linked to cognitive and age-related decline. Current assessments rely on subjective reports prone to recall bias, limiting their effectiveness for longitudinal monitoring. Data-driven analysis of sleep using physiological signals such as EEG and EMG remains underutilised, particularly in mid-to-older adults. Objective: We present a deep learning pipeline for automated sleep staging and label-free abnormality scoring, with the primary objective of quantifying deviations in sleep architecture to capture progressive sleep disruption and longitudinal change. Methods: Temporal and attention-based models were benchmarked using datasets from the National Sleep Research Resource and PhysioBank. To improve class-specific performance, we introduce a stacking-based ensemble of sleep stage classifiers, each trained to specialise in a different stage. For longitudinal scoring, we develop a reconstruction loss-based abnormality metric using a temporal convolutional autoencoder trained on hypnograms generated by the sleep staging models. Results: Attention-based models, particularly AttnSleep, achieved the highest performance in both multimodal and single-channel settings (accuracy: 0.85 and 0.83; F1: 0.79 and 0.74, respectively). The encoder-decoder ensemble model improved overall classification accuracy by 3% compared to the best-performing biased base classifier, with a modest gain in N1-stage F1 score (0.444). The proposed abnormality score correlated with Pittsburgh Sleep Quality Index components and showed sensitivity to synthetic hypnogram degradation, highlighting its potential as a label-free indicator of sleep disruption. Conclusion: Automated classification and annotation-free scoring enable an end-to-end multimodal pipeline that supports scalable, objective sleep health monitoring, with relevance for future clinical deployment.

4
Narcolepsy Revolution - Protocol and Methodology A diagnostic accuracy study protocol using the Dreem 3 headband for ambulatory diagnosis of narcolepsy in children and young adults

Rossor, T.; Rush, C.; Senior, E.; Birdseye, A.; Piantino, C.; Perez Carbonell, L.; Leschziner, G.; Bartsch, U.; Gringras, P.

2026-03-27 health systems and quality improvement 10.64898/2026.03.25.26349319 medRxiv
Top 0.1%
22.8%
Show abstract

Background Narcolepsy is a rare, lifelong neurological disorder that often begins in childhood or adolescence. Diagnosis is frequently delayed because current diagnostic testing relies on specialist in-patient sleep investigations: overnight polysomnography (PSG) followed by a multiple sleep latency test (MSLT), interpreted according to International Classification of Sleep Disorders criteria (ICSD-3-TR). These investigations are expensive, labour intensive, and available in a limited number of centres, contributing to delays and inequity of access. Automated analysis of sleep-stage probabilities (hypnodensity) using neural networks has shown promising diagnostic performance in research cohorts but still requires hospital-based PSG acquisition. The Dreem 3 headband (DH) is a comfortable, dry-montage EEG device designed for home use. Combined with its proprietary machine learning classification of sleep stages, it may offer accurate ambulatory sleep physiology assessments and support clinical decision making. Methods This was a single-centre, prospective, observational study recruiting 60 participants aged 10 to 35 years undergoing investigation for hypersomnolence within GSTT sleep services and scheduled for PSG and MSLT as part of routine care. Exclusion criteria included physician-diagnosed medical or psychiatric disorder that could independently account for excessive daytime sleepiness; and/ or regular use of prescribed or recreational medication known to affect sleep architecture. Participants first wore the DH at home for five weeknights, followed by a continuous 48-hour weekend recording using two devices in rotation. They then underwent routine in-patient PSG and MSLT. PSG and MSLT were interpreted according to ICSD-3 by an experienced sleep physician and a final diagnosis determined by a sleep physiology consultant. The primary outcome is accuracy of ambulatory DH-based assessment of sleep physiology and subsequent diagnosis of sleep disorders. We evaluate proprietary and in-house developed machine learning methods to detect SOREM epochs and predict narcolepsy diagnosis from PSG, PSG+MSLT and DH data. All algorithmic outcomes will be compared to clinical outcomes derived from current clinical standard of care. Discussion This study will provide proof-of-concept evidence for a home-based wearable EEG approach to narcolepsy diagnosis. Patient and public involvement work with young people with confirmed narcolepsy indicates high acceptability of the DH protocol: in a survey of ten young people, eight reported they would be willing to wear a sleep headband nightly at home for five days (two were unsure), and seven reported they would be willing to wear it continuously for 48 hours over a weekend (two were unsure; one said no). These findings informed the decision to restrict continuous wear to the weekend, reflecting feedback that daytime wear during school or work hours would be unacceptable. If validated, this approach could reduce delays to diagnosis, improve equity of access, and support development of a subsequent multicentre study. Trial registration IRAS Project ID: 321547. Registered October 2022. Recruitment was completed on 30 January 2026.

5
The effect of physical activity timing on insomnia and sleep quality: a randomized cross-over trial in older adults

Albalak, G.; Noordam, R.; van der Elst, M.; Drop, T.; Caneda Cabrera, E.; Oudendijk, L.; Lammers, G. J.; Gordijn, M.; Kervezee, L.; Exadaktylos, V.; van Bodegom, D.; van Heemst, D.

2026-05-20 geriatric medicine 10.64898/2026.05.18.26353463 medRxiv
Top 0.1%
19.2%
Show abstract

Background Insomnia symptoms are common in older adults. While observational studies suggest physical activity (PA) timing affects health outcomes, its effect on sleep remains unclear. We compared morning versus evening PA effects on insomnia severity and sleep quality in older adults with insomnia symptoms. Methods Eligible participants were aged 60 to 80 years with (sub)clinical insomnia (Insomnia Severity Index [ISI] score [&ge;]10). In a randomized cross-over trial, participants engaged in coached PA in the morning (10:00 - 11:00) or evening (19:30 - 20:30) for 14 days each. ISI scores were assessed post-intervention. Objective sleep parameters; duration, latency, efficiency, and timing, were assessed with a Withings Sleep Analyzer under the mattress. Subjective sleep quality was reported daily via smartphone app. Salivary dim light melatonin onset (DLMO) was measured on the final day of each intervention. Results Of 37 participants (mean ISI 14.3 {+/-} 3.3), 27 completed the study (mean age 69.8 {+/-} 5; 63% women). ISI scores improved after both morning ({Delta} - 2.5; 95% CI: - 1.14, - 3.83) and evening ({Delta} - 2.0; 95% CI: - 0.63, - 3.38) activity relative to baseline, but were not different between interventions. Compared to evening activity, sleep midpoint occurred earlier with morning activity (03:40 vs 04:00; {Delta} - 20 min; 95% CI: - 31, - 8). No differences in subjective sleep quality or DLMO were found. Exploratory analyses suggested insomnia scores improved specifically in late chronotypes following morning activity. Conclusions While morning vs. evening PA timing did not impact most sleep quality measures, it influenced sleep timing. Larger studies are needed to define optimal and personalized PA timing for improving sleep.

6
Are different consumer sleep technologies measuring the same essential aspects of sleep?

G Ravindran, K. K.; della Monica, C.; Atzori, G.; M Pineda, M.; Nilforooshan, R.; Hassanin, H.; Revell, V. L.; Dijk, D.-J.

2026-04-01 public and global health 10.64898/2026.03.31.26349815 medRxiv
Top 0.1%
18.5%
Show abstract

Study objectives Consumer sleep technologies (CSTs) enable low-burden longitudinal sleep monitoring, and their output measures are often interpreted as equivalent to polysomnography (PSG) measures. We applied a measurement reliability-aware approach to determine whether CST-derived 'sleep' measures (1) are interchangeable or device-specific, (2) can reliably assess trait-like sleep characteristics of an individual, (3) can be reduced to latent principal components of sleep, and (4) can be used for classification and biomarker discovery. Methods Data from 74 older adults (20 people living with dementia [PLWD]) were collected at-home (upto 14 nights; Total=752nights) using four tools simultaneously: research-grade actigraphy (Axivity), a wearable (Withings Watch), a nearable (Withings Sleep Analyzer) and Sleep Diary, followed by one in-lab PSG assessment. We used repeated-measures correlation analyses, intraclass correlation coefficients (ICC), principal component analysis (PCA) and binary classification models to address our objectives. Results Single-night between-device correlations and correlations with PSG were moderate (0.3[&le;]r<0.7) for some duration- and timing-related measures, but other associations were weak (r<0.3). Seventy-one percent of sleep measures reached acceptable reliability (ICC[&ge;]0.7) within seven nights of aggregation, but the required aggregation window varied across measures, tools and between PLWD and Controls. Reliability-filtered PCA yielded stable and interpretable principal components, but Duration was the only component showing moderate between-device association. Principal components were successfully used to classify PLWD vs Controls but feature importance varied across devices. Conclusions Aggregation of CST derived measures across 7-14 nights, yielded reliable measures, most of which were device-specific, with duration being the only essential aspect transferable between devices.

7
Validity and Limitations of the Empatica E4 Wristband for Autonomic and Thermoregulatory Sleep Monitoring Against Concurrent Polysomnography: A Wearanize+ Dataset Study

Parry, Y. D.; Briganti, G.

2026-06-11 health informatics 10.64898/2026.06.10.26355348 medRxiv
Top 0.1%
18.5%
Show abstract

The Empatica E4 wristband provides continuous multi-modal physiological monitoring including blood volume pulse (BVP), electrodermal activity (EDA) and skin temperature (TEMP) but its validity for sleep-stage-specific autonomic and thermoregulatory monitoring has not been systematically evaluated against concurrent polysomnography (PSG). Using the Wearanize+ dataset which provides synchronised PSG, Empatica E4, and Zmax EEG recordings from 100 home-recorded participants; a systematic validation of Empatica E4 physiological signals against PSG ground truth across five sleep stages was conducted. Of 100 participants, 92 had Empatica data; 69 met Zmax EEG signal quality criteria and formed the analysis sample. Heart rate (HR) from the pre-computed Empatica HR channel showed valid stage-specific patterns (Wake: 70.9 bpm, N3: 61.2 bpm) and moderate inter-device MeanNN correspondence with PSG ECG (Spearman r=0.35-0.42 across stages). Skin temperature showed the expected thermoregulatory pattern (Wake: 33.92C, N3: 35.48C) and is recommended for downstream analyses. Tonic EDA showed an inverted stage pattern attributable to wrist sweat accumulation during deep sleep, representing a known confound for wrist-worn EDA during sleep. Phasic EDA showed plausible patterns and may be used with caution. These findings establish a validated feature set for Empatica E4 sleep research and directly inform multimodal psychiatric biomarker studies using the Wearanize+ dataset.

8
The Bedtime Trap: Smartphone Use Until Sleep Onset and Its Association With Sleep Quality and Academic Performance Among Medical Students in Punjab, Pakistan: A Cross-Sectional Survey

Sajjad, M.

2026-06-02 health informatics 10.64898/2026.05.30.26354530 medRxiv
Top 0.1%
14.6%
Show abstract

Smartphone use among medical students has become pervasive. While existing literature links excessive smartphone use to poor sleep quality, the specific behavioral pattern most strongly associated with sleep disruption remains insufficiently characterized. This study investigated whether the timing of smartphone cessation relative to sleep onset is more strongly associated with poor sleep quality than total daily screen time among medical students in Punjab, Pakistan, and examined the moderating role of exam period status. A cross-sectional anonymous online survey was conducted among medical students across Punjab, Pakistan (May 2026). Sleep quality was assessed using items informed by Pittsburgh Sleep Quality Index (PSQI) response formats. Descriptive statistics, chi-square tests, and binary logistic regression were applied to 369 eligible responses, reported in accordance with STROBE guidelines. Of 369 respondents (49.9% female, 48.2% male), 74.8% reported using smartphones 6 or more hours daily and 61.2% used their smartphone until falling asleep. Overall, 75.7% reported poor sleep quality. Students using smartphones until sleep onset had 95.1% poor sleep quality compared to 44.8% in those who ceased use before sleeping (p<0.001). In logistic regression with both variables entered simultaneously, bedtime use until sleep onset remained independently associated with poor sleep quality (OR 15.3, 95% CI 5.7-41.2, p<0.001), while total daily screen time lost significance (OR 1.8, 95% CI 0.7-4.7, p=0.228). Outside exam periods, 99.0% of students using smartphones until sleep onset reported poor sleep quality versus 24.2% of those who stopped before sleeping, a difference of 74.8 percentage points (p<0.001). During exam periods, no significant association was observed (p=0.075), suggesting exam-related stress may attenuate the bedtime behavior effect. Hostel-dwelling students showed the highest prevalence of bedtime smartphone use, with 79.0% using smartphones until sleep onset compared to 23.2% of family-living students (p<0.001). Bedtime smartphone use until sleep onset is more strongly associated with poor sleep quality than total daily screen time among Pakistani medical students. Medical institutions should consider integrating targeted digital wellness education specifically addressing bedtime cessation timing into student health programs, with particular attention to hostel-dwelling students.

9
Effectiveness of Lesser Known Herbal Sedatives for Insomnia: A Systematic Review and Meta-Analysis

Paracha, M. A.; Khan, S. A. J.; Zarkaish, R.; Fazal, F.; Khan, M. D.; Ahmad, M.

2026-03-25 public and global health 10.64898/2026.03.23.26349099 medRxiv
Top 0.1%
14.5%
Show abstract

Abstract Background Insomnia is a major public health problem affecting an estimated 852 million adults worldwide. Current pharmacological treatments, including benzodiazepines and Z-drugs, carry serious risks of dependency, cognitive impairment, and adverse events. These limitations have driven growing interest in complementary and alternative therapies, particularly herbal sedatives, which are perceived as natural and safer. However, evidence on their safety and efficacy remains insufficient and patchy. Objective: This review evaluated the effectiveness of lesser known herbal sedatives for insomnia. Methods The protocol was registered with PROSPERO (CRD420251101795). Eligibility was defined using the PICO framework: Population: adults aged [&ge;]18 years with insomnia; Interventions: Passiflora incarnata, Hawthorn, Melissa officinalis, Chamomile, Viola odorata, Nelumbo nucifera, Rhodiola rosea, and Eschscholtzia californica. Comparators: placebo or usual care; Primary and Secondary Outcomes: sleep quality (Pittsburgh Sleep Quality Index, Insomnia Severity Index, Epworth Sleepiness Scale), sleep duration, and sleep latency. Databases and registers were searched from January 2005 to July 2025. Randomized controlled trials, nonrandomized controlled trials, clinical trials, and observational studies were included. Five reviewers independently screened studies. Data extraction used a structured Excel spreadsheet. Risk of bias was assessed using RoB 2.0 for randomized trials and ROBINS-I V2 for nonrandomized studies. Random-effects meta-analyses (DerSimonian and Laird) were conducted in RevMan. Narrative synthesis followed SWiM guidelines. Results From 1,294 records, 32 studies met eligibility criteria. Meta-analysis of 23 RCTs demonstrated a statistically significant pooled effect favouring herbal sedatives (SMD -0.77, 95% CI -1.14 to -0.40, p=0.0001), with substantial heterogeneity (I square=92%). Subgroup analysis showed larger effects for chamomile (SMD -1.06) and Melissa officinalis (SMD -0.66). Most RCTs had high overall risk of bias; nonrandomized studies predominantly had critical risk of bias. Conclusions This systematic review provides preliminary evidence that several herbal sedatives, particularly chamomile and Melissa officinalis, may improve insomnia-related outcomes. However, methodological weaknesses, high risk of bias, and substantial heterogeneity limit evidence strength. Future research requires standardized extracts, large multicentre RCTs, and extended follow-up.

10
Wearable sleep staging using photoplethysmography and accelerometry across sleep apnea severity: a focus on very severe sleep apnea

Ogaki, S.; Kaneda, M.; Nohara, T.; Fujita, S.; Osako, N.; Yagi, T.; Tomita, Y.; Ogata, T.

2026-04-13 health informatics 10.64898/2026.04.09.26350266 medRxiv
Top 0.1%
12.4%
Show abstract

Study ObjectivesTo evaluate wearable sleep staging across sleep apnea severity, including very severe sleep apnea defined as an apnea-hypopnea index (AHI)[&ge;] 50 events/h, and to assess how training-set composition affects performance in this subgroup. MethodsWe analyzed 552 overnight recordings, 318 from the Sleep Lab Dataset and 234 from the Hospital Dataset. In the Hospital Dataset, 26.5% had very severe sleep apnea. We developed a deep learning model for sleep staging using RR intervals from wrist-worn photoplethysmography and three-axis accelerometry. Baseline performance was assessed by cross-validation under 5-stage and 4-stage staging. We examined night-level associations with AHI severity. We also compared the baseline model with an ablation model trained on the same number of recordings but with more Sleep Lab Dataset and lower-AHI Hospital Dataset recordings, evaluating both models in the very severe subgroup. ResultsIn 5-stage classification, Cohens kappa was 0.586 in the Sleep Lab Dataset and 0.446 in the Hospital Dataset. Under 4-stage staging, the gap narrowed, with kappa values of 0.632 and 0.525, respectively. In the Hospital Dataset, performance declined with increasing AHI severity. Among 62 recordings with very severe sleep apnea, reducing high-AHI representation in training lowered kappa from 0.365 to 0.303. ConclusionsWearable sleep staging performance declined across greater sleep apnea severity in this clinical cohort. Clinical utility may benefit from training data that better represent the target severity spectrum and from selecting staging granularity to match the intended use case. Statement of SignificanceRepeated laboratory polysomnography is impractical for long-term sleep apnea management. Wearable sleep staging could support scalable monitoring, yet its reliability in clinically severe sleep apnea has remained unclear. This study developed and evaluated a wearable sleep staging approach in both sleep-laboratory and hospital cohorts. The hospital cohort included many severe and very severe cases. Performance was lower in the hospital cohort and declined with greater sleep apnea severity. A coarser staging scheme reduced the gap between cohorts, and models trained without representative very severe cases performed worse in this target population. These findings highlight the value of severity-aware model development and motivate future multi-night home validation with reliability cues.

11
Resolving the Deep Sleep Dual Indeterminacy Problem: Context-Dependent Slow-Wave Activity Modeling Predicts Neurobehavioral Fatigue Where Clinical Sleep Modeling Fails

Vattikuti, S.; Xie, H.; Chow, C. C.; Balkin, T. J.; Hughes, J. D.

2026-03-28 physiology 10.64898/2026.03.25.714331 medRxiv
Top 0.1%
10.1%
Show abstract

Deep sleep is widely considered to be the most recuperative component of sleep restoration. Accordingly, a positive relationship between naturally occurring deep sleep and function (e.g., cognitive performance) is often assumed. However, this assumption warrants closer examination--particularly given the rise of sleep tracking that emphasizes traditional sleep metrics and their implied predictive value. We present evidence that while clinical deep sleep scoring provides no predictive value, slow-wave activity (SWA) exhibits a paradoxical association with both improved and worsened neurobehavioral fatigue following sleep deprivation. Specifically, we found that SWA-based models account for approximately 50-60% of the inter-individual variance in recovery from sleep deprivation. Remarkably, when regressed against recovery from sleep deprivation, SWA during the baseline sleep night showed a negative association (normalized {beta} = (-)0.5, p = 0.001) while in the same model SWA during the subsequent wakefulness period showed an opposite positive association (normalized {beta} = 0.5, p = 0.001). Furthermore, although the group-averaged SWA while behaviorally awake increased with impairment across the sleep deprivation period, individual-level data revealed an inverse relationship: individuals more resilient to sleep deprivation exhibited greater SWA in-between mental test sessions and less corresponding impairment during wakefulness suggestive of a protective effect. These findings identify a Deep Sleep Dual Indeterminacy Problem -- simultaneous measurement and causal indeterminacy -- that explains why clinical sleep staging fails as a functional biomarker across a wide range of outcomes, and provide a principled framework for next-generation sleep metrics grounded in continuous electrophysiology and temporal modeling.

12
Sleep Spindle-Locked Targeted Memory Reactivation Enhances Declarative Memory Consolidation

Mutreja, V.; Gupta, P.; Lungu, O.; Lazzouni, L.; Gabitov, E.; Benali, H.; Jourde, H.; Beltrame, G.; Coffey, E. B.; Lina, J.-M.; Albouy, G.; King, B.; Boutin, A.; Carrier, J.; Doyon, J.

2026-05-12 neuroscience 10.64898/2026.05.08.723823 medRxiv
Top 0.1%
9.9%
Show abstract

Study ObjectivesSleep spindles are implicated in memory consolidation. Yet direct evidence linking spindle dynamics to declarative memory outcomes remains limited. We thus tested whether targeted memory reactivation (TMR) time-locked to sleep spindles enhances declarative memory, and whether the temporal organization of stimulated spindles-trains versus isolated events-is selectively associated with distinct memory outcomes. MethodsTwenty-eight healthy young adults learned image locations from two categories (animals, clothing) in a grid, each paired with a distinct auditory cue. During overnight NREM sleep, one cue was replayed time-locked to spindles detected in real-time using a closed-loop system (TMR condition); the other served as the non-reactivated control (No-TMR condition). Category-cue assignment was counterbalanced. Post-sleep recall, recognition accuracy, and movement time were assessed. ResultsRecall accuracy was significantly higher in the TMR than the No-TMR condition (93.96% vs. 90.61%, p = .024), whereas recognition accuracy (p = .139) and movement time (p = .651) did not differ. Stimulation intensity within spindle trains correlated with the TMR effect on recall (Spearman {rho} = .531, p = .004), whereas the proportion of isolated spindle stimulations correlated with the TMR effect on recognition ({rho} = .563, p = .002). Cross-associations were not significant. ConclusionsSpindle-locked TMR enhances recall-based declarative memory retention. The selective association between spindle temporal clustering and memory outcomes suggests that train-embedded and isolated spindles support different aspects of memory consolidation, highlighting spindle temporal context as a functionally relevant dimension of sleep-dependent memory processing.

13
Reduced nighttime smartphone use among cohabiting partners: a longitudinal study under the lens of social control of health behaviors theory

Klasson, T. A.; Rod, N. H.; Zucco, A. G.

2026-06-12 epidemiology 10.64898/2026.06.09.26355243 medRxiv
Top 0.1%
9.1%
Show abstract

Objective: We examined the link between cohabitation with a partner and nighttime smartphone use through the social control of health behavior theory. Background: Nighttime smartphone use is a behavioral risk factor for sleep problems. While previous research has predominantly focused on individual-level risks of sleep disturbances, the role of social context remains underexplored. Theoretical frameworks, specifically the Social Control of Health Behavior, suggest that social relationships regulate health-related behaviors; however, it is unclear how far this regulation extends to modern digital behaviors among couples. Method: We analyzed survey data from three waves of the SmartSleep Study (2018, 2020, and 2023; total N = 25,028), including a longitudinal follow-up subset (N = 1,003). We tested multivariate associations between living with a partner, changes in cohabitation status and frequent nighttime smartphone use by fitting generalized linear mixed-effects models. Additionally, we mapped the complex interplay between indicators of social integration, social support, smartphone use, and sleep quality using hierarchical clustering of non-linear correlations. Results: Cohabiting participants had lower odds of frequent nighttime smartphone use compared to those living alone (OR = 0.66; 95% CI: 0.61, 0.72). This lower risk was driven primarily by cohabitation with a partner (OR = 0.49; 95% CI: 0.36, 0.66). Longitudinal analysis supported these findings, showing that sustained cohabitation was associated with less frequent nighttime use (OR = 0.56; 95% CI: 0.38, 0.82). Clustering analysis revealed that indicators of social integration and support clustered with favorable sleep quality. Conclusion: Our findings suggest that the health-protective effects of cohabitation with a partner extend to digital behaviors. Consistent with social control of health behavior theory, the presence of a partner appears to reduce frequent nighttime smartphone use, highlighting the critical importance of considering social context when addressing digital health hygiene and promoting sleep.

14
Beyond Aging, Sex and Insomnia Disorder Shape NREM Brain Oscillations

Walsh, N.; Perrault, A. A.; Cross, N.; Maltezos, A.; Phillips, E.-M.; Barbaux, L.; Weiner, O.; Dyment, C.; Borgetto, F.; Gouin, J.-P.; Dang Vu, T. T.

2026-03-19 neuroscience 10.64898/2026.03.17.712450 medRxiv
Top 0.1%
7.3%
Show abstract

ObjectivesChronic insomnia (INS) is particularly prevalent in older adults and females. Sex-and age-related differences in neurophysiological markers of sleep quality (sleep spindles and slow-wave activity [SWA]) may underlie differential vulnerability to INS. This study investigated the effects of sex and insomnia on spindle and SWA beyond aging, to better understand the mechanistic differences contributing to the higher prevalence of INS in females. MethodsAfter a habituation night, one night of sleep assessed with polysomnography was analyzed in 222 adults (aged 18-82) including 119 INS (71% female) and 103 healthy sleepers (HS; 61% female). Spindle density, slow oscillation (SO) density, relative sigma power and SWA were derived during NREM sleep. Age, group, sex, and group-by-sex interactions were examined, with age as a covariate. ResultsAge, insomnia, and sex each contributed uniquely to NREM oscillatory activity. INS primarily reduced spindle and SO density, while sex accounted for differences in SWA. While SWA was higher in females overall, sex differences were not significant within the INS or HS groups. Female INS reported highest rates of insomnia severity as well as lower sigma power than males in the INS group. Spindle and SO density deficits were also present in female INS relative to female HS, as well as male INS relative to male HS. ConclusionsThe combination of reduced sigma power in females with insomnia relative to their male counterparts, as well as less spindle and SO density compared to female healthy sleepers may contribute to greater insomnia severity in females. Statement of SignificanceInsomnia is a growing public health concern that is more commonly reported in females, yet the neural mechanisms underlying this sex difference remain poorly understood. Our findings suggest that specific markers of sleep quality are disproportionately disrupted in females with insomnia, potentially contributing to greater vulnerability and symptom severity. These results provide new insight into how sex influences the neurophysiology of insomnia disorder and identify oscillatory markers that could serve as targets for personalized interventions. Future research should investigate whether these alterations represent persistent dysfunction or reversible changes, which could advance understanding of the biological basis of insomnia and inform strategies to improve sleep health in at-risk populations.

15
Insights from nine nights of self-applied, low-density sleep EEG during sleep restriction therapy: a proof-of-concept evaluation

Stanyer, E. C.; Le Roux, M.; Sharman, R.; Ribeiro Pereira, S. I.; Davidson, S. M.; Tarassenko, L.; Espie, C. A.; Kyle, S. D.

2026-05-15 psychiatry and clinical psychology 10.64898/2026.05.08.26348885 medRxiv
Top 0.1%
6.9%
Show abstract

Objectives: Self-applied, low-density EEG offers opportunities to examine sleep in the home environment, yet its feasibility during behavioural sleep interventions remains unexplored. This pilot study aimed to evaluate the feasibility and acceptability of a self-applied, low-density EEG device during sleep restriction therapy (SRT) and explore effects on sleep and affect. Methods: Seventeen adults with insomnia and depressive symptoms completed a 2-week baseline and 4 weeks of SRT. The primary outcome was the proportion of expected EEG recordings completed and scoreable. Secondary outcomes included clinical measures, sleep continuity (sleep diary, actigraphy), sleep architecture (low-density EEG for 9 nights), power spectral density, and affect. Data were analysed with linear mixed models. Cohen's d and 95% confidence intervals were reported. Results: Feasibility was demonstrated (92% of expected EEG nights completed). SRT was associated with reductions in insomnia severity, depressive symptoms, negative affect, and increases in positive affect. Robust improvements were observed across treatment in sleep continuity (SOL, WASO, SE) from diary, which were paralleled by actigraphy. EEG revealed reduced TIB, TST, N1, N2, REM sleep, and REM latency during week one. Reductions in EEG-derived TIB and N1 sleep were maintained at night 28. There were no reliable differences for spectral or spindle measures. Conclusions: These findings suggest that self-applied, low-density EEG during SRT is feasible, acceptable, and may capture sleep changes during treatment. They highlight the potential for multi-night monitoring of sleep interventions at home and elucidating mechanisms underlying therapeutic change.

16
The Sleep-Wake Classification Performance of Pediatric-Trained Machine Learning Algorithms for Raw Accelerometer Data

Chen, P.-W.; Cielo, C.; Walsh, O.; Mcdonald, M.; Song, P. X.; Goldstein, C.; Moreno, J. P.; Jansen, E.; Mitchell, J. A.

2026-06-01 pediatrics 10.64898/2026.05.28.26354364 medRxiv
Top 0.1%
6.8%
Show abstract

Introduction: Actigraphy sleep-wake classification methods increasingly seek to leverage raw acceleration data and machine-learning-based classification, but performance evaluation in pediatrics is limited. We trained machine-learning models using pediatric data and compared their sleep-wake classification performance with existing algorithms for children. Methods: Sixty-five children (46% female, ages 5.3 to 17.7 years) completed in-lab overnight polysomnography and wore a GENEActiv device on their non-dominant wrist. The acceleration data were converted into 30-second epochs and aligned with physician-scored sleep-wake data from electroencephalography. Seven machine-learning models were trained using leave-one-subject-out cross-validation. Epoch-by-epoch analyses generated performance metrics (e.g., balanced accuracy [BA]) and discrepancy analyses provided overall sleep duration bias estimates. The combination of highest performance and least bias was used to rank using Euclidean distance scores - where a lower score represents closer to perfect performance and zero bias. For benchmarking, we included GGIR sleep scoring algorithms and an adult trained random forest classifier. Results: Overall, 560.1 hours of polysomnography and actigraphy data were collected (74.4% of epochs were scored as sleep). The pediatric-trained local-global long-short term memory (LSTM) classifier had the most optimal epoch-by-epoch performance (e.g., BA=0.85, sensitivity=0.88, specificity=0.83, ROC-AUC=0.95, and Cohen kappa=0.67). These metrics exceeded that of an adult-trained random forest classifier and GGIR-based algorithms. Discrepancy analyses revealed that overall sleep duration was underestimated by an average of 25 minutes using the LSTM classifier with no proportional bias. Conclusion: We trained seven pediatric sleep-wake classifiers that had strong ability to detect sleep and wake, with the LSTM classifier being most optimal.

17
Sleep initiation difficulties involve weaker neural and physiological sleep transitions, particularly in children with neurodevelopmental conditions

Hacohen, M.; Dinstein, I.; Guendelman, M.

2026-03-18 neuroscience 10.64898/2026.03.14.711131 medRxiv
Top 0.1%
6.3%
Show abstract

The transition from wake to stable sleep is characterized by multiple neural, physiological, and behavioral changes. How these changes may differ in individuals with difficulties falling asleep such as children with neurodevelopmental conditions is poorly understood. Here, we studied sleep initiation in >2000 nights recorded from 186 children who participated in the Simons Sleep Project (SSP). Data included simultaneous, synchronized recordings of actigraphy, electroencephalography (EEG), photoplethysmography (PPG), and skin temperature. We extracted multiple neural, physiological, and behavioral measures that are known to increase/decrease during the sleep initiation period including EEG delta (1-4Hz) power, movement counts, heart rate (HR), and skin temperature. Transitions from 20 minutes before sleep onset to 40 minutes after sleep onset were modeled with a sigmoid function enabling the quantification of transition timing, speed, and magnitude per measure. Individuals with longer sleep onset latencies (SOL) exhibited smaller increases in EEG delta power and skin temperature as well as smaller decreases in HR and activity counts. These findings indicate that difficulties falling asleep are associated with multiple forms of cortical, physiological, and behavioral hyperarousal that can be measured at home with wearable devices. Importantly, transition magnitudes were key to explaining differences in SOL across participants (26% explained variance) in contrast to transition speed or timing within the sleep initiation period (<13% explained variance). Longer SOL and weaker transitions were particularly prominent in children diagnosed with autism and/or attention deficit hyperactivity disorder (ADHD).

18
Wearable-Derived Long-Term Behavioral Patterns and Short-Term Dynamics Associated With Depressive Symptom Severity

Rim, J.; Xu, Q.; Tang, X.; Pinkerton, C.; Guo, Y.; Qu, A.

2026-05-30 public and global health 10.64898/2026.05.27.26354070 medRxiv
Top 0.1%
6.3%
Show abstract

Background Wearable-based studies have largely examined activity and sleep using static summaries or single time windows, potentially missing how chronic patterns and recent behavioral changes jointly relate to depressive symptom severity. We evaluated whether combining long-term habitual behavior with short-term dynamics improves characterization of moderate-to-severe depressive symptoms. Methods We analyzed Fitbit data from All of Us participants with Patient Health Questionnaire-9 (PHQ-9) assessments, defining moderate-to-severe symptoms as PHQ-9 [&ge;] 10 (N=248). Logistic regression evaluated long-term measures (past-year step count and awake time after sleep onset) and short-term dynamics (30-day step decline and 30-day sleep duration variability), adjusting for demographics. Performance was assessed via repeated stratified 10-fold cross-validation. Results Thirty percent of participants (n = 74) had moderate-to-severe depressive symptoms. Higher long-term step count was associated with lower odds of elevated symptoms (OR = 0.75 per 1,000 steps/day), greater awake time after sleep onset with higher odds (OR = 1.27 per 1%), a 30-day step decline with higher odds (OR = 2.70), and greater 30-day sleep variability with higher odds (OR = 1.07 per percentage point). Short-term dynamics provided complementary information beyond long-term measures alone. The combined model achieved the highest discrimination (area under the curve [AUC] = 0.80 vs. 0.73 demographics-only), though findings should be interpreted as exploratory given the modest sample size. Limitations The sample was modest in size (N = 248), PHQ-9 reflects symptom severity rather than clinical diagnosis, causal inference is not possible given the cross-sectional outcome assessment, and Fitbit users may not represent broader populations. Conclusions Long-term behavioral patterns and short-term changes in activity and sleep were associated with depressive symptom severity, supporting wearable-derived measures as potential adjunctive markers in mental health research.

19
Apnea-hypopnea index estimation with wrist-worn photoplethysmography

Fonseca, P.; Ross, M.; van Meulen, F.; Asin, J.; van Gilst, M. M.; Overeem, S.

2026-04-11 health informatics 10.64898/2026.04.08.26350411 medRxiv
Top 0.1%
6.2%
Show abstract

ObjectiveLong term monitoring of obstructive sleep apnea (OSA) severity may be relevant for several clinical applications. We developed a method for estimating the apnea-hypopnea index (AHI) using wrist-worn, reflective photoplethysmography (PPG). ApproachA neural network was developed to detect respiratory events using PPG and PPG-derived sleep stages as input. The development database encompassed retrospective data from three polysomnographic datasets (N=3111), including a dataset with concurrent reflective PPG recordings from a wrist-worn device (N=969). The model was pre-trained with (transmissive) finger-PPG signals from all overnight recordings and then fine-tuned to wrist-PPG characteristics using transfer learning. Validation was performed on the test portion of the development set and on a fourth, external hold-out dataset containing both wrist-PPG and PSG data (N=171). Performance was evaluated in terms of AHI estimation accuracy and OSA severity classification. Main ResultsThe fine-tuned wrist-PPG model demonstrated strong agreement with the PSG-derived gold-standard AHI, achieving intra-class correlation coefficients of 0.87 in the test portion of the development set and 0.91 in the external hold-out validation set. Diagnostic performance was high, with accuracies above 80% for all severity thresholds. SignificanceThe study highlights the potential of reflective PPG-based AHI estimation, achieving high estimation performance in comparison with PSG. These measurements can be performed with relatively comfortable sensors integrated in convenient wrist-worn wearables, enabling long-term assessment of sleep disordered breathing, both in a diagnostic phase, and during therapy follow-up.

20
Sleep physiology in late pregnancy: A video-based, multi-night, in-home, level 3 sleep apnea study of pregnant participants and their bed partners

Kember, A. J.; Ritchie, L.; Zia, H.; Elangainesan, P.; Gilad, N.; Warland, J.; Taati, B.; Dolatabadi, E.; Hobson, S.

2026-04-25 obstetrics and gynecology 10.64898/2026.04.17.26351131 medRxiv
Top 0.1%
5.0%
Show abstract

We completed a video-based, four-night, in-home, level 3 sleep apnea study of healthy, low-risk pregnant participants and their bed partners in order to characterize sleep physiology in the third trimester of pregnancy. Demographic, anthropometric, and baseline sleep health characteristics were recorded, and the NightOwl home sleep apnea test device was used to measure sleep breathing, posture, and architecture parameters. Symptoms of restless legs syndrome were elicited in the exit interview. Forty-one pregnant participants and 36 bed partners completed the study. Bed partners had a significantly higher prevalence of sleep apnea than their pregnant co-sleepers (31% vs. 5.9%). Bed partners also had more severe sleep apnea than their pregnant co-sleepers, and this persisted on an adjusted analysis for baseline differences in factors known to increase risk of sleep apnea. In pregnant participants, increasing gestational age was found to be protective against mild respiratory events but not more severe events. While the correlation between STOP-Bang score and measures of sleep apnea severity was weak, an affirmative response to the "witnessed apneas" item on the STOP-Bang questionnaire was a strong predictor of more severe sleep apnea for all participants. Smoking history also increased sleep apnea risk. Pregnant participants had lower sleep efficiency and longer self-reported sleep onset latency. Restless legs syndrome was experienced by 39.5% of the pregnant participants but no bed partners. From a sleep breathing perspective, people with healthy, low-risk pregnancies have better sleep than their bed partners despite lower sleep efficiency and higher rates of restless legs syndrome. Clinical Study RegistrationSleep in Late Pregnancy - Artificial Intelligence Development for the Detection of Disturbances and Disorders (SLeeP AID4), https://clinicaltrials.gov/study/NCT05376475, registration ID NCT05376475. Statement of SignificancePregnancy negatively impacts sleep, and poor sleep in pregnancy negatively impacts maternal and fetal health. Pregnancy represents a unique challenge to sleep breathing physiology and, thus, an opportunity to test for sleep apnea. Sleep apnea however, while increased in pregnancy, is more common in males. This novel study tested healthy people with low-risk pregnancies and their bed partners for sleep apnea in the comfort of their home over four nights in late pregnancy. Sleep apnea was more common and worse in the bed partners. Advancing gestational age was protective against mild but not severe sleep apnea, and a critical remaining knowledge gap is this interplay in high-risk pregnancies. Future sleep in pregnancy research should make efforts to include high-risk pregnancies.